Use of Controlled Vocabularies to Improve Biomedical Information Retrieval Tasks

نویسندگان

  • Emilie Pasche
  • Julien Gobeill
  • Dina Vishnyakova
  • Patrick Ruch
  • Christian Lovis
چکیده

The high heterogeneity of biomedical vocabulary is a major obstacle for information retrieval in large biomedical collections. Therefore, using biomedical controlled vocabularies is crucial for managing these contents. We investigate the impact of query expansion based on controlled vocabularies to improve the effectiveness of two search engines. Our strategy relies on the enrichment of users' queries with additional terms, directly derived from such vocabularies applied to infectious diseases and chemical patents. We observed that query expansion based on pathogen names resulted in improvements of the top-precision of our first search engine, while the normalization of diseases degraded the top-precision. The expansion of chemical entities, which was performed on the second search engine, positively affected the mean average precision. We have shown that query expansion of some types of biomedical entities has a great potential to improve search effectiveness; therefore a fine-tuning of query expansion strategies could help improving the performances of search engines.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Utilization of Ontology Look-Up Services in Information Retrieval for Biomedical Literature

With the vast amount of biomedical data we face the necessity to improve information retrieval processes in biomedical domain. The use of biomedical ontologies facilitated the combination of various data sources (e.g. scientific literature, clinical data repository) by increasing the quality of information retrieval and reducing the maintenance efforts. In this context, we developed Ontology Lo...

متن کامل

MeSH Up: effective MeSH text classification for improved document retrieval

MOTIVATION Controlled vocabularies such as the Medical Subject Headings (MeSH) thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing biomedical information by reducing the ambiguity inherent to free-text data. Different methods of automating the assignment of MeSH concepts have been proposed to replace manual annotation, but they are either limited to a small...

متن کامل

First Participation of University and Hospitals of Geneva to Domain-Specific Track in CLEF 2008

We participate in 2008 to our first Domain-Specific Track, with the aim to establish a baseline for our Information Retrieval engine in an unknown domain for us. We are specialized in Natural Language Processing in the biomedical domain, and we participate to the medical Image track and to TREC Genomics for four years with textual strategies, as queries expansions with controlled vocabularies, ...

متن کامل

Biomedical ontologies.

Biomedical ontologies provide an organizational framework of the concepts involved in biological entities and processes in a system of hierarchical and associative relations that allows reasoning about biomedical knowledge. In contrast, biomedical terminologies promote a standard way of naming these concepts. Differences among various kinds of terminological systems can be briefly summarized as...

متن کامل

Medical Informatics: Concepts and Applications

Medical Informatics is a developing body of knowledge concerned with the use of information and communication technology in support of medical research, education and also for promoting health care delivery. The field focuses on the biomedical information, patient data, and also acquisition, storage, retrieval and optimal use of information for problem solving and decision making. The goal of m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Studies in health technology and informatics

دوره 192  شماره 

صفحات  -

تاریخ انتشار 2013